Rank in Wordlist | Frequency | Word |
---|---|---|
186 | 74 | хатын-кыз |
1111 | 13 | ир-ат |
1346 | 11 | тар-мар |
1584 | 9 | бер-бер |
1681 | 9 | хатын-кызлар |
1799 | 8 | азык-төлек |
2054 | 7 | Санкт-Петербург |
2136 | 7 | капма-каршы |
2228 | 7 | турыдан-туры |
2572 | 6 | мг-экв/л |
2741 | 5 | 10-15 |
2857 | 5 | Нью-Йорк |
2942 | 5 | бер-берсен |
2943 | 5 | бер-берсенә |
3201 | 5 | тирә-як |
3374 | 4 | 3-нче |
3750 | 4 | кара-каршы |
3800 | 4 | кулга-кул |
3922 | 4 | савыт-саба |
4049 | 4 | уку-укыту |
Rank in Wordlist | Frequency | Word |
---|---|---|
7381 | 2 | Яда-Яходы-Яха |
10322 | 1 | 0-0-0 |
10323 | 1 | 0-631-20814-3 |
12610 | 1 | Ай-Кур-Ех |
12611 | 1 | Ай-Моккун-Ях |
12613 | 1 | Ай-Суны-Еган |
12817 | 1 | Арка-Есета-Яха |
12818 | 1 | Арка-Тутысыма-Яха |
13236 | 1 | Бисмилләәһир-рахмәәнир-рахиим |
13501 | 1 | Варка-Сыль-Кы |
Rank in Wordlist | Frequency | Word |
---|---|---|
10323 | 1 | 0-631-20814-3 |
15261 | 1 | Кутлон-Ай-Кур-Ех |
17400 | 1 | ТПН-1-22-11 |
31669 | 1 | әт-Тарих-ул-Кәбир |
Some languages allow the formation of longer word by composition using hyphens. Moreover, proper names may contain hyphens. Therefore we look for the most frequent words containing 1, 2, 3 or 4 hyphens.
Usually we find interesting words. But in the case of poor preprocessing there may be unexpected strings resulting from hyphenation etc. Words ending with an hyphen are usually not welcome, too.
For three hyphens:
select w_id-100,freq, word from words where word like "%-%-%-%" limit 10;
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots
3.12.4 Words containing special characters